3574 results found.
Written
Treebank,
Language Type:
Monolingual
Languages:
English
Availability:
From Data Center(s)
License:
Size:
None Production Status:
Existing-used
Use:
Parsing and Tagging
-
Paper title:Neural Reranking for Dependency Parsing: An Evaluation
-
Paper track:Long/Syntax: Tagging, Chunking and Parsing
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Bich-Ngoc Do | Penn Treebank | /N |
Documentation:
None
Written
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
Freely Available
License:
BSD
Size:
419 entries Production Status:
Newly created-finished
Use:
Anaphora, Coreference
-
Paper title:Toward Gender-Inclusive Coreference Resolution
-
Paper track:Long/Ethics and NLP
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Yang Trista Cao | Maybe Ambiguous Pronouns (MAP) Dataset | /N |
Documentation:
An datasheet in English is published along with the dataset
Written
Evaluation Data,
Language Type:
Monolingual
Languages:
English
Availability:
Freely Available
License:
BSD
Size:
95 documents OtherProduction Status:
Newly created-finished
Use:
Anaphora, Coreference
-
Paper title:Toward Gender-Inclusive Coreference Resolution
-
Paper track:Long/Ethics and NLP
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Yang Trista Cao | Gender Inclusive Coreference (GICoref) Dataset | /N |
Documentation:
A datasheet in English is published along with the dataset
Written
Evaluation Data,
Language Type:
Monolingual
Languages:
English
Availability:
Freely Available
License:
Size:
None Production Status:
Newly created-finished
Use:
Machine Learning
-
Paper title:Human Attention Maps for Text Classification: Do Humans and Neural Networks Focus on the Same Words?
-
Paper track:Long/Interpretability and Analysis of Models for NLP
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Cansu Sen | YELP-HAT Dataset | /N |
Documentation:
None
Not Applicable
Contextualsed word embeddings,
Language Type:
Monolingual
Languages:
Ancient Arabic Basque Bokmål Bulgarian Catalan Chinese Church Croatian Czech Danish Dutch English Estonian Finnish French Galician German Greek Hebrew Hindi Hungarian Indonesian Irish Italian Japanese Korean Latin Latvian Norwegian Nynorsk Old Persian Polish Portuguese Romanian Russian Simplified Chinese Slavonic Slovak Slovene Spanish Swedish Turkish Ukrainian Urdu Uyghur Vietnamese
Availability:
Freely Available
License:
none
Size:
18.4 GByte Production Status:
Existing-used
Use:
Parsing and Tagging
-
Paper title:Treebank Embedding Vectors for Out-of-domain Dependency Parsing
-
Paper track:Short/Syntax: Tagging, Chunking and Parsing
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Joachim Wagner | Elmo For Many Languages | /N |
Documentation:
https://www.aclweb.org/anthology/K18-2005/
Written
Evaluation Data,
Language Type:
Monolingual
Languages:
English
Availability:
Freely Available
License:
CC-BY-NC 4.0
Size:
23590 sentences Production Status:
Newly created-finished
Use:
Natural Language Generation
-
Paper title:{ASSET}: {A} Dataset for Tuning and Evaluation of Sentence Simplification Models with Multiple Rewriting Transformations
-
Paper track:Long/Resources and Evaluation
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Fernando Alva-Manchego | ASSET | /N |
Documentation:
None
Written
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
Freely Available
License:
OpenSource
Size:
60.2 MByte Production Status:
Existing-used
Use:
Text Mining
-
Paper title:Hierarchy-Aware Global Model for Hierarchical Text Classification
-
Paper track:Long/Information Retrieval and Text Mining
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Jie Zhou | Web of Science | /N |
Documentation:
None
Written
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
From Data Center(s)
License:
LDC
Size:
1.8 million articles OtherProduction Status:
Existing-used
Use:
Text Mining
-
Paper title:Hierarchy-Aware Global Model for Hierarchical Text Classification
-
Paper track:Long/Information Retrieval and Text Mining
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Jie Zhou | The New York Times Annotated Corpus | /N |
Documentation:
https://catalog.ldc.upenn.edu/docs/LDC2008T19/
Written
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
Freely Available
License:
OpenSource
Size:
2.5 GByte Production Status:
Existing-used
Use:
Text Mining
-
Paper title:Hierarchy-Aware Global Model for Hierarchical Text Classification
-
Paper track:Long/Information Retrieval and Text Mining
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Jie Zhou | Reuters RCV1 | /N |
Documentation:
https://trec.nist.gov/data/reuters/reuters.html
Written
Corpus,
Language Type:
Multilingual
Languages:
Chinese Czech English Finnish German Latvian Romanian Russian Turkish
Availability:
Freely Available
License:
Size:
3.9 MByte Production Status:
Existing-used
Use:
Evaluation/Validation
-
Paper title:Automatic Machine Translation Evaluation using Source Language Inputs and Cross-lingual Language Model
-
Paper track:Short/Resources and Evaluation
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Kosuke Takahashi | WMT18 metrics shared task data | /N |
Documentation:
None




